Mapping-based Data Integration

نویسندگان

  • Toralf Kirsten
  • Erhard Rahm
چکیده

Many bioinformatics applications require data from different sources to answer complex research questions. Integrating such highly diverse data is a major challenge in bioinformatics and often much too laborious and error-prone for scientists. Traditional integration approaches like data warehousing and mediators are often applicable but also time consuming to develop, to deploy and hard to maintain when source schemas change. We introduce the BioFuice system [1] for interconnecting and integrating data from different autonomous sources. It is based on a decentralized peer-to-peer-like infrastructure. We utilize instance (object)-level correspondences between different sources which are often already available in the sources in the form of web links, e.g. based on accession ids. Moreover, such correspondences can be generated by applying tools, e.g. BLAST to associate similar objects based on its DNA/protein sequence similarity. Sets of such correspondences represent mappings between sources which describe objects of different types, such as genes, proteins, and their function. The object types and their corresponding mappings form the so called source mapping model. Mappings are also assigned a semantic mapping type. Together with object types they reflect the semantics of the domain within a so called domain model. The domain model can be used to categorize sources and mappings so that they can be selected and accessed according to application requirements. To process queries and mappings we have devised a set of high-level operators. They can be used within script programs (workflows) to combine and analyze data from different sources. Furthermore, BioFuice provides a graphically user interface for explorative analysis and keyword search which automatically generates script programs from interactively specified queries. Currently, BioFuice integrates data from more than 20 public molecular-biological sources and ontologies, such as Ensembl, GeneOntology, and HomoloGene, but also private sources as result of previous analyses or preferences. The integration approach is applied in various collaborative research projects ranging from analysis of microarray data (IZBI), the analysis of protein interaction networks (MPI MIS) to the detection non-coding RNAs and gene homologues (BioInf).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Integration of Remote Sensing and the GIS-based Methods for Provision of Cadastral Mapping of Agricultural Areas of Ardakan City

In the fifth development plan establishment, establishment of the Cadastre System of agriculture nationwide has been defined to be the work priority of institutions and organizations responsible in the area of agriculture and equity issuance in the country. In this study, the possibility of provision of the Cadastral mapping of agriculture by a integration of the data of the remote sensing and ...

متن کامل

Integration of Remote Sensing and the GIS-based Methods for Provision of Cadastral Mapping of Agricultural Areas of Ardakan City

In the fifth development plan establishment, establishment of the Cadastre System of agriculture nationwide has been defined to be the work priority of institutions and organizations responsible in the area of agriculture and equity issuance in the country. In this study, the possibility of provision of the Cadastral mapping of agriculture by a integration of the data of the remote sensing and ...

متن کامل

Comparing the Effects of Concept Mapping and Integration Method on Nursing Students' Learning in Nursing Process Course in Tabriz University of Medical Sciences

Introduction: To analyze patients' problems and make an appropriate care plan, nursing students need a deep and meaningful learning. Therefore, it is better to choose educational methods which are capable of educating nursing students in such learning level. The aim of this study was to compare the effect of concept mapping and integration model on nursing students' learning in nursing process ...

متن کامل

Comparison of various knowledge-driven and logistic-based mineral prospectivity methods to generate Cu and Au exploration targets Case study: Feyz-Abad area (North of Lut block, NE Iran)

Motivated by the recent successful results of using GIS modeling in a variety of problems related to the geosciences, some knowledge-based methods were applied to a regional scale mapping of the mineral potential, special for Cu-Au mineralization in the Feyz-Abad area located in the NE of Iran. Mineral Prospectivity Mapping (MPM) is a multi-step process that ranks a promising target area for mo...

متن کامل

Challenges and Opportunities for Online Freight Data Mapping Integration and Visualization

This paper presents the issues surrounding the integration and visualization of freight data using internet-based mapping applications. In relation to Internet-based mapping technology in freight data collection and planning we: (a) address implementation issues associated with data integration, (b) present a system architecture to leverage existing publicly available interfaces and web applica...

متن کامل

A Query Driven Method of Mapping from Global Ontology to Local Ontology in Ontology-based Data Integration

At present, the mediator/wrapper integration methods are widely used in ontology based data integration because they solve the data update problems of data warehouse method. The key of this method is building of mapping from the global ontology in mediator to the local ontology in wrapper. This article analyzes the general mapping methods and designs a SPARQL query driven Global Local as View (...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006